Using context and phonetic features in models of etymological sound change

نویسندگان

  • Hannes Wettig
  • Kirill Reshetnikov
  • Roman Yangarber
چکیده

This paper presents a novel method for aligning etymological data, which models context-sensitive rules governing sound change, and utilizes phonetic features of the sounds. The goal is, for a given corpus of cognate sets, to find the best alignment at the sound level. We introduce an imputation procedure to compare the goodness of the resulting models, as well as the goodness of the data sets. We present evaluations to demonstrate that the new model yields improvements in performance, compared to previously reported models.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Web-Based Interactive Tool for Creating, Inspecting, Editing, and Publishing Etymological Datasets

The paper presents the Etymological DICtionary ediTOR (EDICTOR), a free, interactive, web-based tool designed to aid historical linguists in creating, editing, analysing, and publishing etymological datasets. The EDICTOR offers interactive solutions for important tasks in historical linguistics, including facilitated input and segmentation of phonetic transcriptions, quantitative and qualitativ...

متن کامل

Structure-preserving sound change: a look at unstressed vowel syncope in Austronesian

Over the course of the past several hundred years, advances in our understanding of sound change along with 20th century advances in phonetic science and phonological typology have given rise to a new landscape of sound patterns. A particular sound change in a particular language forms part of a population of similar sound changes with similar phonetic bases. Looking at this population, we can ...

متن کامل

MDL-based Models for Alignment of Etymological Data

We introduce several models for alignment of etymological data, that is, for finding the best alignment, given a set of etymological data, at the sound or symbol level. This is intended to obtain a means of measuring the quality of the etymological data sets, in terms of their internal consistency. One of our main goals is to devise automatic methods for aligning the data that are as objective ...

متن کامل

Probabilistic Models for Alignment of Etymological Data

This paper introduces several models for aligning etymological data, or for finding the best alignment at the sound or symbol level, given a set of etymological data. This will provide us a means of measuring the quality of the etymological data sets in terms of their internal consistency. Since one of our main goals is to devise automatic methods for aligning the data that are as objective as ...

متن کامل

A Review of Spatial Factor Modeling Techniques in Recommending Point of Interest Using Location-based Social Network Information

The rapid growth of mobile phone technology and its combination with various technologies like GPS has added location context to social networks and has led to the formation of location-based social networks. In social networking sites, recommender systems are used to recommend points of interest (POIs) to users. Traditional recommender systems, such as film and book recommendations, have a lon...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012